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The estimation of the extremal dependence structure is spoiled 
by the impact of the bias, which increases with the number of obser¬ 
vations used for the estimation. Already known in the univariate set¬ 
ting, the bias correction procedure is studied in this paper under the 
multivariate framework. New families of estimators of the stable tail 
dependence function are obtained. They are asymptotically unbiased 
versions of the empirical estimator introduced by Huang [Statistics of 
bivariate extremes (1992) Erasmus Univ.]. Since the new estimators 
have a regular behavior with respect to the number of observations, 
it is possible to deduce aggregated versions so that the choice of the 
threshold is substantially simplified. An extensive simulation study 
is provided as well as an application on real data. 

1. Introduction. Estimating extreme risks in a multivariate framework is 
highly connected with the estimation of the extremal dependence structure. 
This structure can be described via the stable tail dependence function 
(s.t.d.f.) L, first introduced by Huang (1992). For any arbitrary dimension 
d, consider a multivariate vector with continuous marginal 

cumulative distribution functions (c.d.f.) Fi,...,Fd. The s.t.d.f. is defined 
for each positive reals xi,... ,Xd as 

lim t ^xi or ... or 1 — Fd{X^^'^) <t-\d] 

t—^oo 

= L{xi,...,Xd). 

Assuming that such a limit exists and is nondegenerate is equivalent to the 
classical assumption of existence of a multivariate domain of attraction for 
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the componentwise maxima; see, for example, de Haan and Ferreira (2006), 
Chapter 7. The previous limit can be rewritten as 

(1) lim t[l-F{F{^{l-t~^xi),...,F~^{l-r^Xd)}] = L{xi,...,Xd), 


where F denotes the multivariate c.d.f. of the vector and 

for j = 1,...,d, F~^{t) = infjz G M: Fj{z) > t}. Consider a sample of size n 
drawn from F and an intermediate sequence, that is to say a sequence k = 
k{n) tending to infinity as n —oo, with /c/n —)• 0. Denote by x = (xi, ..., Xd) 
a vector of the positive quadrant = {(xi,. .., Xd ): Xj > 0, j = 1,..., d} 

and by ^ the kih order statistics among n realizations of the margins 
X^-^^ The empirical estimator of T(x) is obtained from (1), replacing F by 
its empirical version, t by n/k, and — t~^Xj) for j = 1,..., d by its 

empirical counterpart ■] n’ 


( 2 ) 




2=1 




or 


i — n — [kx^]-\-l,n 


r 


See Huang (1992) for pioneering works on this estimator. Under suitable 
conditions, it can be shown (see Section 2) that the estimator Tfc(x) has the 
following asymptotic expansion: 

(3) Lfc(x)-L(x) « + a(n//c)M(x), 

where is a continuous centered Gaussian process, a is a function that 
tends to 0 at infinity and M is a continuous function. In particular 
Vk{Lk (x) — T(x)} can be approximated in distribution by Zl{x.), provided 
that Vka{n/k) tends to 0 as n tends to infinity. This condition imposes a 
slow rate of convergence of the estimator Lk{x), so one would be interested 
in relaxing this hypothesis. As a counterpart, as soon as y/ka{n/k) tends 
to a nonnull constant A, an asymptotic bias appears and is explicitely given 
by AM(x). The aim of this paper is to provide a procedure that reduces 
the asymptotic bias. The latter will be estimated and then subtracted from 
the empirical estimator. This kind of approach has been considered in the 
univariate setting for the bias correction of the extreme value index with 
unknown sign by Cai, de Haan and Zhou (2013). Refer also to Peng (1998, 
2010) Fraga Alves, de Haan and Lin (2003), Gomes, de Haan and Rodrigues 
(2008) and Gaeiro, Gomes and Rodrigues (2009) for previous contributions 
on this problem. Note finally that the case of dependent sequences has been 
recently studied by de Haan, Mercadier and Zhou (2014). 

The nonparametric estimation of the extremal dependence structure has 
been widely studied in the bivariate case; see, for instance, Huang (1992), 



BIAS CORRECTION IN MULTIVARIATE EXTREMES 


3 


Einmahl, de Haan and Sinha (1997), Caperaa and Fougeres (2000), Abdous 
and Ghoudi (2005), Guillotte, Perron and Segers (2011) and Bucher, Dette 
and Volgushev (2011). Bias correction problems in the bivariate context 
received less attention than in the univariate setting. To the best of our 
knowledge, it seems to be reduced to Beirlant, Dierckx and Guillou (2011) 
and Goegebeur and Guillou (2013), who consider the estimation of bivariate 
joint tails, which differs slightly from our task. 

As for the multivariate framework, de Haan and Resnick (1993) intro¬ 
duces the empirical estimator. General approaches under parametric as¬ 
sumptions on the function L have been developed, for example, by Goles and 
Tawn (1991), Joe, Smith and Weissman (1992), Einmahl, Krajina and Segers 
(2008) and Einmahl, Krajina and Segers (2012). Apparently, no procedure 
correcting the bias can be found in the literature for dimension greater than 
two. The objective of this article is to fill this gap. Note that our method 
does not only consist of applying the univariate bias procedure at several 
points. Indeed, the bias is no longer a parametric function, so that the new 
feature is mainly the fact that we are able to estimate and then subtract 
a function with an unknown form. Two families of asymptotically unbiased 
estimators of the s.t.d.f. are proposed, and their theoretical behaviors are 
studied. A practical advantage of these new estimators is that they can be 
aggregated, thus reducing the variability. 

The paper is organized as follows: Section 2 contains hypotheses and first 
results. The bias reduction procedure is described in Section 3, and the 
main theoretical results are presented therein. Several theoretical models 
are exhibited in Section 4 that satisfy the required assumptions. Section 5 
illustrates the performance of the new estimators on both simulated and real 
data. The estimation of side components is postponed up to Section 6. The 
proofs are relegated to Section 8. 

2. Notation, assumptions and first results. Let Xi = (Af\...,A{'^^), 

... ,X„ = (Ai^\ ... be independent and identically distributed mul¬ 

tivariate random vectors with c.d.f. F and continuous marginal c.d.f.’s Fj 
for j = Suppose F is in the domain of attraction of an extreme 

value distribution with c.d.f. G. We recall that it supposes the existence for 
j = 1,... ,d of sequences > 0, bn' of real numbers and a c.d.f. G with 
nondegenerate marginals such that 

lim P(max{Aj^\ ..., A^} < a^xi -L 6^,..., 

n^oo 

max{A{'^\ .. < a^^'^Xd + b^n^) = G(x) 

for all points x where G is continuous. Denote by Gj the jth marginal c.d.f. 
of G. It is possible to show that the domain of attraction condition can 
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be expressed as condition (1) along with the convergence of the marginal 
distributions to the G/s, and that 

(4) L(x) = -logG({-logGi}"^(xi),...,{-logGd}"^(xd)). 

Let /r be the measure defined by 

(5) /i{^(x)} :=L(x), 

where ^(x) := {u G there exists j such that Uj > xj} for any vector 
X G M+. 

Several conditions are now described. The first two have been introduced 
by de Haan and Resnick (1993): 

- The first-order condition consists of assuming that the limit given in (1) 

exists, and that the convergence is uniform on any for T > 0. 

- The second-order condition consists of assuming the existence of a positive 
function ce, such that a{t) —)• 0 as t —>■ oo, and a nonnull function M such 
that for all x with positive coordinates, 

lim -J^{t[l - T{Ff ^(1 - r^xi ),..., - t-^Xd)}] - L(x)} 

(6) 

= M(x), 

uniformly on any [0,T]'^, for T > 0. 

- The third-order condition consists of assuming the existence of a positive 
function /3, such that fi{t) —^ 0 as t ^ oo, and a nonnull function N such 
that for all x with positive coordinates, 

lim [ ^[1 ~ ■ ■ ■, F-^{1 - - L(x) 

I3(t) 1 a(t) 

(7) 

= iV(x), 

uniformly on any [0,T]'^, for T > 0. This implicitly requires that N is not 
a multiple of the function M; see Remark 2. 

Remark 1. The function L defined by (1) and that appears in (6) 
and (7) is homogeneous of order 1. We refer, for instance, to de Haan and 
Ferreira (2006), pages 213 and 236. Most of the estimators constructed in 
this paper use the homogeneity property. Note that pointwise convergence 
in (1) entails uniform convergence on the square [0,r]'^. See, for instance, 
de Haan and Ferreira (2006), page 237. 
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Remark 2. li N = c - M for some constant c, the relation can be refor¬ 
mulated as 


1 

t^m\ a{t){l + c/3{t)) 

= 0 , 


M(x) 


which we want to exclude. We refer to de Haan and Ferreira [(2006), page 385], 
to see that the same complication turns up in the one-dimensional case. 


Remark 3. The functions M and N involved in the second and third- 
order conditions satisfy some usual properties; see, for example, de Haan and 
Resnick (1993). More specifically, one can show that there exist nonpositive 
reals p and p' such that a (resp., /3) is a regularly varying function of order p 
(resp., p'), that is, a{tz)/a{t) when t —)• oo, for each positive z. Besides, 

M is homogeneous of order 1 — p, that is to say M{rx.) = r^“^M(x), for 
each positive r and x with positive coordinates. Finally, the function N is 
homogeneous of order 1 — p — p'. 

Remark 4. An interesting situation to consider is when the c.d.f. F is 
in the domain of attraction of an extreme value distribution G with inde¬ 
pendent components, that is, G = 0^=1 Gj. Such a c.d.f. is said to have the 
property of asymptotic independence. In this case, the function M is the 
limit of the joint tail of the distribution, and in dimension 2, the coefficient 
of tail dependence p introduced by Ledford and Tawn (1996, 1997) equals 
1/(1 — p), where p is defined in Remark 3. 

In this paper, we will handle two sets of assumptions. First consider: 

(A2) - the second-order condition is satished, so that (6) holds; 

- the coefficient of regular variation p of the function a defined in (6) 
is negative; 

- the function M defined in (6) is continuous. 

These hypotheses allow us to get the asymptotic uniform behavior of L^, 
the empirical estimator of L defined by (2), as detailed in the following 
proposition. 


Proposition 1. Let Xi,...,X„ be independent multivariate random 
vectors in with common joint c.d.f. F and continuous marginal c.d.f.’s 
Fj for j = l,...,d. Assume that the set of conditions (A2) holds. Sup¬ 
pose further that the first-order partial derivatives of L (denoted by djL 
for j = 1,... ,d) exist and that djL is continuous on the set of points {x = 
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>0}. Consider the estimator of L defined by (2) 
where k is such that ^/ka{n/k) —)■ oo. Then as n tends to infinity, we get 

1 


sup 

0<a:i,...,a;d<T 


a{n/k) 


{Lfc(x) -L(x)} -M(x) 


0 . 


Under stronger assumptions, and for some choice of the intermediate se¬ 
quence, the asymptotic distribution of the previous stochastic process can 
be obtained after multiplication by the rate y/ka{n/k). For a positive T, 
let D{[0,T]'^) be the space of real valued functions that are right-continuous 
with left-limits. Now introduce the conditions: 

(A3) - the third-order condition is satisfied, so that (6) and (7) hold; 

- the coefficients of regular variation p and p' of the functions a and 
j3 defined in (6) and (7) are negative; 

- the function M defined in (6) is differentiable and N defined in (7) 
is continuous. 


Proposition 2. Assume that the conditions of Proposition 1 are ful¬ 
filled and that the set of conditions (A3) hold. Consider the estimator of L 
defined by (2) where k is such that Vka{n/k) —)• oo and Vka{n/k)(i{n/k) —>• 
0. Then as n tends to infinity, 

M(x)|4Zi(x), 

in D{[0,T]'^) for every T >0, where 


( 8 ) 


Vfc|Lfe(x) - L(x) - 


(9) 


d 


Zl(x) := 1Fl(x) - '^WL{xjej)djL{yL). 
i=i 


The process Wl above is a continuous centered Gaussian process with co- 
variance structure E[Wi(x)VFL(y)] = p{R{x.) n R{y)} given in terms of the 
measure p defined by (5) and of i7(x) = {u G there exists j such that 

0 <Uj < Xj}. 


Remark 5. A difference between the previous result and Theorem 7.2.2 
of de Haan and Ferreira (2006) consists of the choice of the intermediate se¬ 
quence that is larger here. Indeed, we suppose \Vkoi{n/k)\ —)• oo whereas 
they choose k{n) = which implies ^/ka{n/k) —)• 0. Our choice 

requires the more informative second-order condition (6). A nonnull asymp¬ 
totic bias appears in our framework. 

Remark 6. The conditions onk, a and /? required in Proposition 2 are 
not too restrictive: because of the regular variation of a and j3, they are 
implied by the choice k{n) = n^, with k G (— 







BIAS CORRECTION IN MULTIVARIATE EXTREMES 


7 


3. Bias reduction procedure. As pointed out in Remark 5, a nonnull 
asymptotic bias a{n/k)M{x.) appears from Proposition 2. The bias reduc¬ 
tion procedure will consist of subtracting the estimated asymptotic bias 
obtained in Section 3.1. The key ingredient is the homogeneity of the func¬ 
tions L and M mentioned in Remarks 1 and 3. This homogeneity will also 
provide other constructions to get rid of the asymptotic bias. 

3.1. Estimation of the asymptotic bias of L^. Equation (8) suggests a 
natural correction of as soon as an estimator of a{n/k)M(x.) is available. 
In order to take advantage of the homogeneity of L, let us introduce a 
positive scale parameter a which allows to contract or to dilate the observed 
points. We denote 


(10) 

^/c,a(x):=a ^Lkiax) 

and 


(11) 



From (8) one gets 

a“^M(x)| -A a~^ZLiax), 

in Z1([0,T]'^) for every T > 0. Equations (11) and Proposition 1 yield as n 
tends to infinity, 

(13) A(a-0-l)M(x). 

a[n/k) 

Fixing a such that a~P — 1 = 1, a natural estimator of the asymptotic bias 
of Tfc(x) is thus A^ 2 -i/p(^)) where p is an estimator of p. Recall that the 
unknown parameter p is the regular variation index of the function a in¬ 
volved in the-second order condition. Let kp be an intermediate sequence 
that represents the number of order statistics used in the estimator p. As¬ 
sume that kp^ k where k = k{n) is the sequence used in Proposition 2. A 
first asymptotically unbiased estimator of L(x) can be dehned as 

O /V 

(14) Lfc,i,fcp(x) := Lfc(x) - Afc 2-i/p(x). 

The asymptotic behavior of this estimator is provided in Theorem 3 and 
Remark 8. We refer the reader to Section 6 for more details concerning the 
estimation of p. 


(12) v^|Lfc,a(x) - L(x) - 
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3.2. Estimation of the asymptotic bias of L^ a- The previous construc¬ 
tion can be easily generalized by correcting the estimator „ instead of 
Lfc. Indeed, from (12) one can see that the asymptotic bias of Lfc^a(x) is 
a(^)a“^M(x). Recall that when n tends to infinity, one has for any positive 
real 6, 


a{n/k) 


l)M(x). 


Thus fixing b such that b ^ — 1 = a p will help to cancel the asymptotic 
bias. It yields the following asymptotically unbiased estimator of L: 

O /S /X 

(15) Lk^a,kp{^) ■= Lk,a{^) ~ ■ 


Theorem 3. Assume that the conditions of Proposition 2 are fulfilled, 
and consider the estimator of L defined by (15). Let kp be an intermediate 
sequenee sueh that ^J^a{n/kp){p — p) converges in distribution. Suppose 
also that k is such that k = o{kp), Vka{n/k) —)• oo and y/ka{n/k)(3{n/k) — >• 
0. Under these assumptions, as n tends to infinity, 

( 16 ) \/fc{Lfc,a,fcp(x) - L(x)}-A l)i(x), 

, o 

in T>([0,T]“) for every T > 0, where Ya is a eontinuous centered Gaussian 
process defined by 

Ya(x) := Zl{-k) - b~^ZL{byi) + a~^ZL{ax) 

with covariance K[Ya{x.)Ya{y)] = E[Z 2 ,(x)Zi(y)](l — 6 “^/^-|-andb = 
{a-P + l)-^/p. 


Remark 7. The assumption that ^Jk'pO^{n/kp){p — p) converges in dis¬ 
tribution will be reconsidered in Section 6. 


Remark 8. Theorem 3 remains true when a = 1 and thus characterizes 
the asymptotic behavior of the estimator given in (14). For this particular 
choice of a, the covariance reduces to E[Zi(x)Zi(y)](2 — 2^/^^)^. 

3.3. An alternative estimation of the asymptotie bias of Lk^a- The pro¬ 
cedure of bias reduction introduced in the previous section requires the es¬ 
timation of the second-order parameter p. It is actually possible to avoid 
it, making use of combinations of estimators of L. The asymptotic bias of 
Lfc^a(x) is a(^)a“^M(x), as already noted from (12). Making use of (13) 
and homogeneity of M, one gets as n tends to infinity, 

Afcp,a(Qx) _^ g-P 

Akp,aiax) - aAkp,a{^) - 1 ’ 
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for any intermediate sequence kp that satisfies ■^J^pa{n/kp) —)• oo. The ex¬ 
pression 


can thus be used as an estimator of the asymptotic bias of Lfc^a(x). After 
simplifications, this leads to a new family of asymptotically unbiased esti¬ 
mators of T(x) by substracting the estimated bias from Lfc^a(x), namely 


(17) 


L 


k^a,ki 


,(x) = 


Lfc(x)Afc^^a(Qx) - Lfc(ax)Afcp,a(x) 

Afcp,a(ax) - aAfcp^a(x) 


which is well dehned for any real number a such that 0 < a < 1. 


Theorem 4. Assume that the conditions of Proposition 2 are fulfilled, 
and consider the estimator of L defined by (17). Let kp he an intermediate 
sequence such that ■>J^pa{n/kp){p — p) converges in distribution. Suppose 
also that k is such that k = o{kp), y/ka{n/k) oo, y/k = 0{y^a{n/kp)) 
and '/ka{n/k)j3{n/k) —)• 0. Assume moreover that the function M never 
vanishes except on the axes. Then, as n tends to infinity, 

( 18 ) Vfc{Lfc,a,fcp(x)-L(x)}-Aya(x), 

in D{[e,TY‘) for every e > 0 and T > 0, where Ya is a continuous centered 
Gaussian process with covariance E[ya(x)ya(y)] given by "ElZ l{^)Z^{ y)] x 
{a~P — 

Remark 9. The covariance function specihed above is decreasing with 
respect to the parameter a for any fixed value of p. This suggests at first 
glance to choose a close to 1 in order to reduce the asymptotic variance of 
Ya, but this would give a degenerate form of (17). See Section 5 for practical 
considerations for the choice of a. 


4. Theoretical examples. The aim of this section is to furnish several 
multivariate distributions that satisfy the third-order condition (7). For the 
sake of simplicity, expressions are displayed in the bivariate setting. We start 
by focusing on heavy-tailed margins. In this case, a first possible step to get 
the pointwise convergence is to obtain, for well-chosen positive reals p and 
q, an expansion (for t tending to infinity) of the form 

tF{X > Y’x or y > L^y) 

= Ti{x,y) + a{t)T 2 {x,y) + a{t)l3{t)T2,{x,y) + o{a{t)l3{t)), 
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with Ti(l, 1) > 0. One can then identify each term involved in (7) as follows: 
L{x,y) =Ti{a{x),b{y)), M{x,y) = T 2 {a{x),h{y)) and 


N{x,y) = T 3 {a{x),b{y)), 

where 


a[x) = x ^{ri(l,+oo)}^, b{x)=x ^{ri(+oo, 1)}^- 


Applying Resnick [(1986), Corollary 5.18], one can check that in such a 
framework a form of the bivariate extreme value distribution G is given by 


G{x,y) =exp 


Ti{x,y) \ 


4.1. Powered norm densities. Following the idea of Resnick [(1986), pages 
276 and 286] consider first a norm || • ||, and a cone V of M^, that is to say, 
a set such that if {x,y) G V, then {tx,ty) G V for every positive t. Without 
loss of generality, suppose that (1,1) G V. Let {X,Y) be a bivariate random 
vector with probability density function given by 

fi:, y) .= __ 

where c is a normalizing positive constant and where a and /3 are some 
positive real numbers such that a(d > 2 . Set Ax>{x,y) := {{u,v) ^'D:u> x 
or V > y}, and define p := (a/3 — 2 )“^. One can check that for j = 1, 2 ,3, 


Tjix,y) = 


ATi{x,y) 


ccj du dv 

||(n,n)^||“(/3+i-i)’ 


where ci = 1, C 2 = —jS and C 3 = /3(/3 + l)/2. The functions M and N are 
homogeneous with order given through p = p' = —ap. 

Let us discuss some particular choices of the norm: 


- For the L^-norm and a = 1, the model coincides with the bivariate Pareto 
of type II distribution, denoted by BPII(/3) in this paper, and referred to as 
MpO)(//)(o, l,/3 —2) in Kotz, Balakrishnan and Johnson (2000), page 604. 
In this case, p = q = {(3 — 2)“^, and L{x, y) =x + y — {x~p + y~P)~^/P. The 
latter s.t.d.f. is known as the negative logistic model, introduced by Joe 
(1990); see also Beirlant et al. (2004), page 307. 

- When the Euclidean norm is chosen, one recovers the bivariate Cauchy 

distribution for a = 2, /3 = 3/2 and p=l. On the positive quadrant, that 
means for V = M^, we have c = 2 / 7 r, ri(u, v) = c(u“^ + and a{x) = 

b{x) = cjx. On the whole plane, which means that T) = M^, we get c = 
l/(27r), Ti{u,v) = c{u~^ + v~^ + (u“^ + and a{x) = b{x) = 2c/x. 

This can also be seen as a particular case of the following item. 
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- The Student distributions with Pearson correlation coefficient 9 arise 
choosing the norm ||(x,y)^|| = — 26xy + for a positive 

real number u, a = 2, (3 = {v + 2)/2 and p = . In this case, the integral 

form of the function Ti cannot be totally simplified, and one classically 
writes the s.t.d.f. as 


L{x,y) = {x + y) 


y „ /T-pr 


+ 


x + y 


-F, 


u+l 


\ vrv# 


where is the c.d.f. of the univariate Student distribution with u + l 
degrees of freedom. This dependence structure is also obtained for some 
elliptical models; see, for example, [Krajina (2012), page 1813] and next 
subsection. 

- Other choices for the norm would lead to other distributions. Note that one 
can also relax the symmetry condition, considering, for instance, the Ma- 
halanobis pseudo-norm dehned by ||(x,y)'^|p = (x/cr)^ — 2p{xja\yj t) -|- 
(y/r)^ for a real number p such that \p\ < 1 and some positive real num¬ 
bers a and r. 


4.2. Elliptical distributions. Consider the usual representation of the cen¬ 
tered elliptical distribution = RAU, in terms of a positive random 

variable i?, a 2 x 2 matrix A such that 5] = AA^ is of full rank, and a 
bivariate random vector U independent of R, uniformly distributed on the 
unit circle of the plane. Assume that R has a probability density function 
denoted by qr. One can then express the probability density function of 
(A, Y) as 

fix,y) ■■= j^^^^gR{{x,y)^~\x,yf}. 

A sufficient condition to satisfy (7) is to assume that the distribution of R 
belongs to the Hall and Welsch class [Hall and Welsh (1985)], namely, 

r{R >r) = cr-i/T'{l + DirP/^ + + o{r^P+P^^/'^)}, 

with positive real c, nonnull reals Di and D 2 and negative reals p and pi. 
One can check that, for j = 1,2,3, 

j, j s _ c rr _ dudv _ 

^ 27r7ldet A] J J u>x or v>y} {{u,v)'E-^u,v)^y+^F2Y+Pj ’ 

where pi = 0 ,p 2 = -p/{‘^l) and p^ = -{p + pi)/{2'y). 
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Assuming for simplicity that 5] = Q the s.t.d.f. can be written as 


L{x,y) = {x + y) 


y p f {y/xV - G /ttt-t] 

VV7 + ij 




+ y 




which is the form already obtained for the Student distribution in Section 4.1 
for u = 1 / 7 . See Demarta and McNeil (2005) for more details. Note finally 
that for a general matrix S and the special case gR{r) = c(l + one 

recovers the Mahalanobis pseudo-norm already mentioned in the previous 
subsection. 

When dealing with margins that are not heavy tailed, the calculus is done 
directly from ( 6 ). The last two examples of bivariate distributions have short 
and light tailed margins, respectively. 


4.3. Archimax distributions. Consider the bivariate c.d.f. defined for each 
0 < u, u < 1 by 

(19) F{u,v) = {1 + L{u~^ - l,v~^ - 1)}“\ 

given in terms of a s.t.d.f. L. This distribution has standard uniform uni¬ 
variate margins and corresponds to a particular case of Archimax bivariate 
copulas introduced in Caperaa, Fougeres and Genest (2000), in which the 
function (fiit) = t~^ — 1 is the Clayton Archimedean generator with index 1. 
Expanding the left-hand side term of (6) leads to, as t tends to infinity, 

t{l - F{1 - t~^x, 1 - t~^y)} = L{x,y) + t~^M{x,y) + t~‘^N{x,y) + o(f“^), 

where 

M{x,y) := x‘^diL{x,y) + y‘^d 2 L{x,y) - L‘^{x,y), 

N{x, y) := x^/2dliL{x, y) + x^y'^dl 2 L{x, y) + //25|2T(x, y) 

+ y) + {x^ - 2x‘^L{x, y))diL{x, y) 

+ {y^ - 2y‘^L{x,y))d2L{x,y). 

This allows us to identify p = p' = —1. Above, the notation dijL stands for 
d‘^L/{dxi dxj). 

4.4. Multivariate symmetric logistic distributions. Consider the c.d.f. de¬ 
fined by 


(20) 


F(x,y) = exp{-(e-"/^ + e-^/^)*}, 
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for each G M, which corresponds to the bivariate extreme value distri¬ 
bution with Gumbel univariate margins -Fi(x) = F 2 {x) = exp{—e“*} and 
symmetric logistic s.t.d.f. L{x,y) = + 2/^^*)*, where 0 < s < 1. This dis¬ 

tribution was introduced in Tawn (1988); see, for example, Beirlant et al. 
(2004), page 304. Expanding leads 

to 

L{x,y) + t~^M{x,y) + t~‘^N{x,y) + o{t~^), 

where 

M{x,y) := -Lyy^/*){L(x,y)}^"^/^ - l{L{x,y)}^, 

N{x,y) := -Ly^ 2 /^/^){L(x,y)}^"^/® 

+ - yf{L{x,y)}^~'^^" 

This allows us to identify p = p' = —1. The identification of second and 
third-order terms has previously be derived by Ledford and Tawn (1997). 

5. Finite sample performances. The purpose of this section is to eval¬ 
uate the performance of the estimators of L introduced in Section 3. For 
simplicity, we will focus on dimension 2, and simulate samples from the dis¬ 
tributions presented in Section 4. Thanks to the homogeneity property, one 
can focus on the estimation of 1 1 —)• L(1 — t,t) for 0 < t < 1, which coincides 
with the Pickands dependence function A; see, for example, Beirlant et al. 
(2004), page 267. Considering first the estimation at t = 1/2 leads to the 
definition of aggregated versions of our estimators. These new estimators 
will be both compared in terms of L^-errors for L or associated level curves. 

5.1. Estimators in practice. Let us start with the estimation of L{lj2, 1/2) 
for the bivariate Student distribution with 2 degrees of freedom. This model 
is a particular case of Sections 4.1 and 4.2. For one sample of size 1000, 
Figure 1 gives, as functions of k, the estimation of L at point (1/2,1/2) by 

/V o 

Lfc, Lk and L^, respectively, defined by (2), (15) and (17). For the last two 
estimators, the parameters have been tuned as follows: a = 0.4, kp = 990 and 
p estimated using (22) with a = r = 0.4. These values have been empirically 
selected based on intensive simulation, and will be kept throughout the pa¬ 
per. One can check from Figure 1 that the empirical estimator Lj^ behaves 
fairly well in terms of bias for small values of k. Besides, the bias is efficiently 
corrected by the two estimators and L^. Since the bias almost vanishes 
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Estimation of L(1/2,1/2) for ths Studsnt(2) modal and n=1000 



Eig. 1. Estimation o/L(l/2,1/2) for the bivariate Student(2) law based on a sample of 
size 1000. 

along the range of k, one can think about reducing the variance through an 

o — 

aggregation in k (via mean or median) of or L^. This leads us to consider 
the two following estimators: 

o o 

Lagg := Median(Lfc, k = l,..., «:„), 

Lagg := Median(Lfc, k = l,..., 

where n is the sample size and Kn is an appropriate fraction of n. Their 
performance will be compared to those of the family {Lk,k = 1,..., n — 1}. 
Simplihed notation {Lk,k} will be used instead of {Lk,k = 1}. 

Because any s.t.d.f. L satishes max(t, 1 — t) < L{1 — t,t) < 1, the competitors 
have been corrected so that they satisfy the same inequalities. 

Remark 10. If ftn satisfies the condition imposed on k^ in Theorems 3 

o — 

and 4, then the aggregated estimators Lagg and Lagg would inherit the 

o ~ 

asymptotic properties of Lfc and L^. Indeed, all the estimators jointly con¬ 
verge, since they are based on a single process. 

Remark 11. In the following simulation study, is arbitrarily fixed 
to n — 1 . Such a choice is open to criticism since it does not satisfy the the¬ 
oretical assumptions mentioned in the previous remark. But it is motivated 
here by the fact that the bias happened to be efficiently corrected, even for 
very large values of k, as already illustrated on Figure 1. Note, however, 
that such a choice would not be systematically the right one. In presence 
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of more complex models such as mixtures, tin should not exceed the size 
of the subpopulation with heaviest tail. To illustrate this point, take, for 
example, the bivariate c.d.f. F = pG + (1 —p)H, where G is the c.d.f. of the 
bivariate BPII(3) model, and H is the uniform c.d.f. on [0,1]^. Then the 
s.t.d.f. is L{x, y) = x + y — {l/x + l/y)~^ , and only p% of the data belong to 
the targeted domain of attraction, so should not exceed pn. 

Classical criteria of quality of an estimator 6 of 9 are the absolute bias 
(ABias) and the mean square error (MSE) defined by 

1 ^ 

ABias=-j;|0W-0|, 

i=l 

1 ^ 

i=l 

where N is the number of replicates of the experiment and 0^*^ is the es¬ 
timate from the zth sample. Note that what we call Ahias is also referred 
as MAE (for Mean Absolute Error) in the literature. Figure 2 plots these 
criteria in the estimation of L(l/2,l/2) for the bivariate Student(2) model 
when n = 1000 and N = 200. Figure 2 exhibits the strong dependence of the 
behavior of in terms of A:, as well as the efficiency of the bias correction 

o ^ 

procedures. The estimator given by (15) outperforms the estimator 
defined by (17), no matter the value of k. Moreover, the ABias and MSE 

o 

curves associated to Lk almost reach the minimum of those of Lk- Finally, 

Absoluts Bias oT L(1/2,1/2) for the Stu<lent(2) model end n^tOOO MSE of L(1f2,ia) Itor the Student(2) model end n^ 000 




Fig. 2. (a) ABias, (b) MSE for the estimation of L{ll2,ll2) in the bivariate Student(2) 

model when n = 1000 as a function of k. 
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the aggregated version Lagg answers surprisingly well to the estimation prob¬ 
lem of the s.t.d.f. L. First, its performance is similar to the best reachable 
from the original estimator Second, it gets rid of the delicate choice of 
a threshold k (or would at least simplify this choice; see Remark 11). These 
comparisons have also been made for five other models obtained from Sec¬ 
tion 4. The results are very similar to the ones obtained for the bivariate 
Student(2) distribution and are therefore not presented. 


5.2. Comparisons in terms of -error for L. The comparisons are now 
handled not only at a single point, but for the whole function using an 
-error defined as follows: 


( 21 ) 


1 


T 

T + l'^ 

t=i 


L 1 - 


t t 
T’T 


-L 1 - 


t t 
T’T 


where T is the size of the subdivision of [0,1]. Figure 3 gives the boxplots 

O ^ /s 

based on = 100 realizations of Tagg,Tagg and {Lk,k} for T = 30 in the 
case of six bivariate models: 


• First row: Cauchy and Student(2) models; 

• Second row: BPII(3) model and Symmetric logistic model with s = 1/3; 

• Third row: Archimax model with logistic generator L{x,y) = + 

and mixed generator L{x,y) = (x^ -\-y‘^ xy)/{x -\-y). 

o 

As already observed in Figure 2, the estimator Lagg is again very com¬ 
petitive compared to the best element of {Lfc,fc}, no matter the choice of 
model. Recall that the value of k leading to the best Lj^ depends crucially 
on the model and is consequently unknown in practice, which invites any 
users to apply this new procedure. 

-- o 

The estimator Lagg is dehnitely less competitive compared to Tagg. Given 
these results we will not pursue with the Lagg estimator in the rest of this 

o 

paper, and will focus our attention on the behavior of Lagg. 


5.3. Comparisons between Lagg, a eonvex version of Lagg, and Peng’s 
estimator. A natural step is now to compare the performance of our best 

o 

estimator Lagg with an existing competitor, recently introduced by Peng 
(2010). In his work, Peng provides a data-driven method which chooses the 

o 

threshold via estimating a s.t.d.f. Another interesting task is to compare Lagg 
with a convexified version of itself, since any s.t.d.f. is a convex function; see, 
for example, Beirlant et al. [(2004), Section 8.2.2] or de Haan and Ferreira 
[(2006), Section 6.1.5]. Note that a general convexification procedure has 
been proposed in dimension 2 by Fils-Villetard, Guillou and Segers (2008); 
see also some alternative suggestions in Bucher, Dette and Volgushev (2011). 
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LI-«fror Of L for ItM Cauchy iiMKMwtMn n«1000 



£. 


L1-orror of L for tho BPIlfS) nwxM whon 0*1000 



L1-ont>r of L lor tho Logl»tlc^2)-Archlinax modol whon ii*1000 


L1-«rror of L for tho Studont(2) moflol whon nolOOO 



U. 

Ll-orror of L for tho Symmotrtc fogiotic modol whon n*1000 



U 

Ll-orror of Lfortho Mb(od(1)-Archimax mo<M whon r*1000 




- O ~ /V 

Fig. 3. Boxplot of the L -error of function L for the estimators Lagg,Lagg and {Lk,k}. 
First row: bivariate Cauchy model (left) and bivariate Student(2) model (right). Second 
row: bivariate BPII(3) model (left) and bivariate Symmetric logistic model (right). Third 
row: bivariate Archimax model with logistic (left) and mixed generator (right). 


















18 


A.-L. FOUGERES, L. DE HAAN AND C. MERCADIER 


In order to take maximal advantage from this simulation study, the three 
different models implemented have been considered in two versions for each: 
the first model is the Gaussian one, simulated with Pearson’s correlation 
coefficient ±0.5. The Gaussian model is a particular case of elliptical dis¬ 
tributions (see Section 4.2), which illustrates the asymptotic independent 
situation; cf. Remark 4. The second model is the bivariate Symmetric logis¬ 
tic one, introduced in Section 4.4, with two different strengths of dependence 
(close to independence on the left column, stronger dependence on the right 
column). The third model is the bivariate Student family, introduced in 
Sections 4.1 and 4.2 as a particular case. Two strengths of dependence have 
also been chosen, close to asymptotic independence on the left column and 
stronger dependence on the right column. 

Our results, summarized in Figure 4, will thus exhibit in particular how 
the performance in the estimation of the s.t.d.f. depends on the distance to 
the asymptotic independence case. The y-axis scale has been fixed for all the 
six cases so that one can measure that the estimation of the s.t.d.f. is a more 
ambitious problem under asymptotic independence. However, our estimator 

o 

Lagg has still nice properties when comparing it to the empirical estimator 
Lk- 

o o 

The convex version Taggc performs quite equivalently as Tagg- A reason 
for this is that by construction our estimator is actually not far from a 
convex function. So balancing the cost of convexifying with the benefit in 

o 

the performance motivates the simple use of Tagg- 

Finally, regarding Peng’s estimator Lp, one observes that this estimator 
is an interesting alternative to the original family {Lfc,A:}, which, however, 
never outperforms our proposal. 

5.4. Estimating a failure probability. Let us illustrate in this subsec¬ 
tion the question of estimating an arbitrarily chosen failure probability 
P{X > 10^ or F > 2 • 10^), where {X,Y) comes from the BPII(3) model, 
so that P{X > 10^ or y > 2 • 10^) = 0.00011665. Samples of size n = 1000 
are considered. Thus empirical estimation will be useless for evaluating the 
probability of exceeding such extreme values for X or Y, and an extrapola¬ 
tion based on extreme value theory is thus needed. 

First assume that it is known that the margins are standard Pareto. This 
probability can be approximated by 

P{X > 10%r y > 2 • 10^) ~ (10"^ ± 5 • 10"®)L(2/3,1/3), 

which naturally comes from (1), the projection on the simplex and the ho¬ 
mogeneity of L. Estimating the unknown parameter L(2/3,l/3) with our 

O /S 

candidate Lagg and the original family {Lk,k} gives several boxplots (based 
on 500 replicates) that are presented in Figure 5. The comparison of these 

o 

estimates is again favorable to Lagg. 
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L1-«rror of L for th« Gaussiaii(0.5) modol whon n«i000 



c. 


Ll-orror of L for tho Symmotrtc logisllc^l^ motM wtwn n»1000 


L1-«rror of L for tho Gaussiart(-0^ mocM whon n«1000 



U 

L1-orror of L for tho Symmotric loglstlc(3) modol whon n^lOOO 




t L 

L1-ofTor of L for tho Studont{20) model when hbIOOC L1-error of L for the $tudent(2) model when n«1000 




- o y ^ 

Fig. 4. Boxplot of the L -error of function L for the estimators Lagg, Laggc, Lp and 
{Lk,k}. First row: bivariate Normal model with correlation t: t = 0.5 (left) and r = —0.5 
(right). Second row: bivariate Symmetric logistic{s) model: s = 1/1.2 (left) and s=l/3 
(right). Third row: bivariate Student(p) model: v = 2Q (left) and v = 2 (right). 
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Eig. 5. Boxplot (based on 500 replicates) for the estimation of P(X > orY > 2 • 10'*) 
when {X, E) is drawn from the BPII(3) model with sample size n = 1000 and assuming 
margins to be known. 


Remark 12. We also investigated the possible use of a second-order 
term in the approximation of the probability P{X > 10^ or R > 2 • 10^), 
making use of the following estimators 

(10-^ + 5 • 10-5)Lagg (I + Q) Vo-' + 5 • 10-5)'-^A, (I . 

The results were so similar to those obtained in Figure 5 that we chose to 
skip them. 

Second, when the margins are not assumed to be known, the estimation of 
pi = l — Fi(lO^) and p 2 = I — F2{2 ■ 10'^) can be reached by the POT method 
[see, e.g., Beirlant et al. (2004), Section 7.4] for several values of a threshold. 
After the study of mean residual life plots and quantile plots, the thresholds 
have been fixed to be X^-k^n and for k = 200. The POT estimates 

deduced with these thresholds are, respectively, denoted by pi and p 2 - The 
probability P{X > 10^ or T > 2 • 10^) is then approximated by 

P{X > 10%r y > 2 • 10^) ~ (pi +P 2 )l( V\ , 

\Pi +P2 Pi +P2 

Estimating on each repetition the unknown parameter L{pi/{pi +P 2 )^P 2 l{pi + 

O /V 

P 2 )) with our candidate Lagg and the original family {Lk,k} gives several 
boxplots (based on 500 replicates) presented in Figure 6. It seems clear that 
the uncertainty on the margins Fi and F 2 has much more influence than 
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BPI(3) model with nelOOO 



Fig. 6. Boxplot (500 replicates) of the estimation of P{X > 10'* or Y > 2 ■ 10*) when 
{X, Y) is drawn from the BPII(3) model with sample size n = 1000 and estimating margins 
by POT method. 

that of the s.t.d.f. L. Such findings corroborate previous studies; see, for 
example, Bruun and Tawn (1998) and de Haan and Sinha (1999). 

5.5. Q-curves. Another nice representation of a function of several vari¬ 
ables is through its level sets. In the case of the function L, it consists of 
looking (for any positive real c) at sets of the form {(x, y) G L(x, y) <c}. 
From homogeneity property, it is characterized by 

Q := {{x,y)€M.l,L{x,y) < 1}. 

Following de Haan and Ferreira [(2006), page 245], the boundary of this set 
can be written as 

dQ = {{b{9) cos 9, b{9) sin6) : b{9) = {L{cos9,sin9))~^,9 G [0,7r/2]}. 

The estimation of dQ is naturally obtained by replacing L by any estimator, 

o ^ 

and this is done here for the estimators Tagg and {Lk,k}. Figure 7 (left) ex¬ 
hibits the bias phenomenon (as k increases) induced by in the estimation 
of the Q-curve. The bias factor on Lfc is illustrated with k = 50, k = 100 and 

o 

k = 800. The correction of the bias with Tagg is effective. As in the previous 
section, the comparison of the different estimators is provided in terms of a 
global criterium based on the L^-norm, given by 
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Estimation of ths Q-curvo for ths Stud6nt(2) modal whan n^l 000 



Q-curva for wava halghts data 


-|-1-1 I 

0.2 0.4 0.6 0.6 

HmO 


Eig. 7. Left: Estimation of the Q-curve for the bivariate Student(2) law based on a 
sample of size 1000. Right: Estimated Q-curve for the wave heights data introdueed in 
de Haan and Ferreira (2006). 

Figure 8 displays the boxplots of this measure, based on N = 100 realizations 
and for T = 30 under the six bivariate models given in the previous section. 

The estimation of the Q-curve based on the original estimator is 
strongly sensitive to the choice of k: the bias (resp., the variability) is an 

o 

increasing (resp., decreasing) function of k. The performances of Tagg is 
similar to that of the best Lj., which is nnknown in practice. These featnres 
corroborate the conclnsions drawn in Section 5.2. 

To close this section, let us illustrate the Q-curve estimation on the wave 
heights data set of de Haan and Ferreira (2006), page 207. As explained 
therein, wave height (HmO) and still water level (SWL) have been recorded 
during 828 storm events on the Dutch coast. The analogous Figure 7.2 from 
de Haan and Ferreira (2006) is reported in Figure 7 (right). Even if the 
two curves are not so close, the conclusion remains the same: the estimated 
boundary is concave, which indicates that the high values of the two variables 
are dependent. 

6. Estimation of second-order components p and M. In this section, 
we focus on the estimation of the function M coming from the second-order 
condition (6) and on the estimation of its homogeneity parameter 1 — p. 

6.1. Second-order parameter p. A possible way to estimate p is to use on 
each margin one of the techniques developed in the univariate setting; see, 
for example. Gomes, de Haan and Peng (2002) or Ciuperca and Mercadier 
(2010). Other methods make use of the multivariate structure of the data; 
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L1-«m>r of Q for the BPII(3) modal when n«1000 




100 200 300 400 s00«00 700 800ft00 

L 



- O -s 

Fig. 8. Boxplot of the L -error of Q-curve for the estimators Lagg and {Lk,k}. First 
row: bivariate Cauchy model (left) and bivariate Student(2) model (right). Second row: 
bivariate BPII(3) model (left) and bivariate Symmetric logistic model (right). Third row: 
bivariate Archimax model with logistic (left) and mixed generator (right). 
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see, for example, Peng (2010) and also Goegebeur and Guillou (2013) in a 
slightly different framework. The construction described here takes likewise 
advantage of the multivariate information of the sample. With this purpose, 
the following proposition shows that a variable of interest is the ratio of two 
terms Ak^a, dehned by ( 11 ). 

Proposition 5. Assume that the conditions of Proposition 1 are ful¬ 
filled and fix positive real numbers r and a G ( 0 , 1 ). Assume moreover that 
the function M never vanishes except on the axes. Then, as n tends to in¬ 
finity, for every e > 0 and T > 0, 

sup 

e<xi,...,Xd<T 

Remark 13. If the requirement that the function M is either positive, 
or negative in the positive quadrant does not hold, one could consider the 
integral of (Afc^a(x))^ over the set {x = (xi,... ,Xd) s.t. + • • • + = 1} 

and prove a result like Lemma 7 for this statistic. Then the integral of 
appears in the denominator in Proposition 5 instead of M itself, and the 
sign of M does not matter. This will be part of a future work. 


Afc,a(Tx) 


At 


— r 


i-p 


X 


A family of consistent estimators of the parameter p can be derived from 
Proposition 5. 


( 22 ) 


Pk,a,r{,^) ■— ( 1 


1 


logr 


log 


Ak,a{rx) 


Afc,a(x) 


AO. 


The following property can be obtained from the asymptotic expansion given 
in Proposition 2. 


Proposition 6. Assume that the conditions of Proposition 2 are ful¬ 
filled, and fix positive real numbers r and a G (0,1). Consider the estimator 
of p defined by (22). Assume moreover that the function M never vanishes 
exeept on the axes. Then, as n tends to infinity, 

{Pfe,a,r(x) ~ p) ^ Zp^a,r{x), 



in D{[e,T]'^) for every e > 0 and T > 0, with 

^ __ a~^ZL{ax) - Zl{x) _ g-^Ziirax) - Zijrx) 

(a“^ — l)M(x) logr (a“/’— l)M(x)ri“/’logr 
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Estimation of p for n=1000 



(®) (b) (c) (d) (•) 

Fig. 9. Boxplot of 500 estimations of p given by (22) using samples of size 1000 drawn 
from six models: (a) Student(2); (b) BPII(3); (c) Symmetric Logistic with s = 1/3; 
(d) Archimax model with logistic generator with s= 1/2; (e) Archimax model with mixed 
generator. Red line indicates the true value of p = —1. 

Figure 9 illustrates the finite sample behavior of this estimator of p for 
a collection of bivariate models introduced in Section 4, for which the true 
value of p is equal to —1. These boxplots show that the estimator performs 
reasonably well in median, no matter the choice of model, but the uncer¬ 
tainty is rather important. Fortunately this seems from simulation studies 
to have only minor influence on the estimation of L. 

6.2. Second-order function M. Recall that from (12) the asymptotic bias 
of Lfc^(j(x) is given by a(^)a“^M(x). In order to circumvent an estimation 
of the term a{n/k), a renormalization is needed, focusing, for instance, on 
the estimation of M(x)/M(l/2) where 1/2 = (1/2,..., 1/2). Thanks to (13), 
this ratio can be consistently estimated by 

Afc,a(x) 

Afc,a(l/2) 

as soon as A; is a well-chosen intermediate sequence. The asymptotic normal¬ 
ity can also be derived from analogous arguments to those used in the proof 
of Proposition 6. Details are not presented here for the sake of simplicity. 

Figure 10 summarizes the behavior of the estimator of the curve 1 1 —>■ 
A4(t, 1 — t)/M(l/2,1/2) through boxplots of the L^-error, defined as in (21). 
We observe from this figure that the best estimation is reached for large 
values of k. This feature does not depend on the degree of asymptotic de¬ 
pendence in the Symmetric logistic model, nor on the strength of the bias of 
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LWrror of M(.)/M(iai/2) for the Symnwtric logistic model with r-0.1 L1-srror of l•(.)/M(1/2,1/2) for tho Symmetric logistic model with r^.S 


10 90 170 260 330 410 490 670 650 730 610 890 970 


L1-srror of M(.)/M(1/2,1/2) for the Symmetric logistic model with r«0.9 Ll-error of M(.)M(1/2.1/2) for the Mlxed<1)*‘Archlm«x model 
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Fig. 10. Boxplot of the -error of M{-) jM{If 2,112)-curve. First row: bivariate logistic 
model with s = 0.1 (left) and with s = 0.5 (right). Second row: bivariate logistic model with 
s = 0.9 (left) and bivariate Archimax with mixed generator (right). 


the original estimator detected on Figure 3. These graphs conhrm that 
the asymptotic bias is remarkably well estimated for large values of k. This 
helps to understand why the bias subtraction is accurate for large or very 
large choices of k, as also commented in Section 5.1. 

7 . Concluding comments. This paper deals with the estimation of the 
extremal dependence structure in a multivariate context. Focusing on the 
s.t.d.f., the empirical counterpart is the nonparametric reference. A common 
feature when modeling extreme events is the delicate choice of the number 
of observations used in the estimation, and it spoils the good performance 
of this estimator. The aim of this paper has been to correct the asymptotic 
bias of the empirical estimator, so that the choice of the threshold becomes 
less sensitive. Two asymptotically unbiased estimators have been proposed 
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and studied, both theoretically and numerically. The estimator defined in 
Section 3.2 proves to outperform the original estimator, whatever the model 
considered. Its aggregated version defined in Section 5.1 appears as a worthy 
candidate to estimate the s.t.d.f. 


8. Proofs. 


Proof of Proposition 1. Denote by the uniform random vari¬ 
ables = 1 — for j = 1,... ,d. Introducing 

1 " 

^{C/P^<fca:i/n or ... or <kx^/n} 

2 = 1 


allows us to rewrite as the following: 


Write 


n(x) = i4( 


[kxd],n 


Lfc(x) - L(x) 


= Vk{TU}^^ 


^ [fcxi],n’ ■ ■ ' ’ [kxd]; 


-\ 

Ti r 


- fii - , F7‘(i - ys,,.„))i 


r('^) 




k 


[/call],?!'' 


lid) 

[kXfi ],?!'' J J 


— T ( 

^ [fca;i],n’■ ■ ■ ’ ^ [fcxrf],n 




—Tji^) '^LTji'^) ^ _ rixl 


and denote ^i_fc(x) [resp., T 2 ,fc(x) and T 3 ^fc(x)] the first line (resp., second 
and third lines) of the right-hand side. 

Applying de Haan and Ferreira [(2006), Proposition 7.2.3] leads to 

v'fcAi_fc(x) -A ITl(x), 

in D([0, T]"^) for every T > 0 and for any intermediate sequence, where Wl is 
a continuous centered Gaussian process with covariance structure specified 
in Proposition 2. Due to the Skorohod construction we can write 


(23) 


sup |\/feAi,fc(x) - ITl(x)| 0 

0<aii,...,aifi<T 


a.s., 
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which implies, since y/ka{n/k) —)• oo, 


sup 

0<x\,...,X4<T 


a{n/k) 


= Of 


Vka{n/k) J 


Again for any intermediate sequence, the proof of de Haan and Ferreira 
[(2006), Theorem 7.2.2] ensures the convergence for j = 1,... ,d 


(24) 

and finally 

(25) 


sup 

a:e[0,T] 




0 


a.s.. 


sup 

0<xi,...,Xd<T 


VfcA3,fc(x) + ^ WL(a;jej)5jL(x) 
As previously, this yields 


a.s. 


sup 

0<xx,...,Xd<T 


^3,fc(x) 


a{n/k) 


= o 


1 


Vka{n/k) 


Since the intermediate sequence satisfies Vka{^) —>■ oo, it thus remains to 
prove that 


sup 


-M(x) 


0 


a.s. 


a{n/k) 

The second-order condition that holds uniformly on [0,T]‘^ in (6) yields 


sup 

0<xi,...,Xd^<T 


^2,fc(x) 


a{n/k) 

Then the result follows from 


-M -a 


n. 


r(l) 


^U^d) 

o • • • > 


^ [A:xi],n’ ■■■’)!(; [kxd\,n 


0 


sup 

0<xi,...,Xd<T 


n. 


M(x) - M ( 1 ,..., 1 


a.s. 


a.s.. 


which is obtained combining (24) and the continuity of the function M. □ 

Proof of Proposition 2. We use the notation introduced in the 
proof of Proposition 1. Thanks to the Skorohod construction, we can start 
from (23). Combined with (25), it is sufficient to prove the convergence 


sup 

0<x^,...,X4<T 


v^|^2,fc(x) -a^^^M(x) 


0 


a.s. 


Note that the third-order condition, the uniformity on [0,T]'^ of the conver¬ 
gence in (7) and the continuity of N yield 


n 


A2,fc(x)=a( - 1M( -U, 


n , 


r(l) 




[kx{\,n'’ ' ' ' ’ [kxci\,n 


n 


Op a - /3 - 


n 
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Thanks to (24) and to the existence of the first-order partial derivatives djM 
(j = 1,..., d) of the function M, we have that 


sup 

0<xi,...,Xd^T 




+ ^ WLixjej)djM{x) 
i=i 

converges to 0 in probability, as n tends to infinity. This implies that 


^/t|.4J.,W-a(^)Mw}|=0,(|^/ta(^)/3(^)+a(^) 

which completes the proof, thanks to the choice of the intermediate sequence. 

□ 


sup 

0<3;i ,...,Xd^T 


Proof of Theorem 3. Recall that b = {a ^ -I- 1) and denote b = 
{a~P + . Write 

(26) Lk^a,kp — L = {Lk a ~ L/} + {Lfc — L} — g — T}, 

which equals, thanks to (12) and under Skorohod’s construction, 

{a~P + l)M(x) -L ■^(a“^ZL(ax) -L ^l(x)) 

-a(^)r-M(x)-(^Z,(i,x) + »(T) 

= ((o-f + 1) - b-nM{x) + Ty;(x) 

+ „(=)(i,-»._i,-0M(x) + o(T) 

= ((o-f + 1) - b-nM{x) + Ty;(x) 

The first term is zero. Since both k = o{kp) and a is regularly varying with 
negative index, the only the last term can be put into the term o(-^). 
Finally, the covariance function follows from the equality in law as processes 
between Zi(ox) and y/aZL{^). □ 

The proofs of Theorem 4 and Proposition 6 are based on the following 
auxiliary result. 
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Lemma 7. Assume that the conditions of Proposition 2 are fulfilled. 
Then for any positive real r, one has as n tends to infinity, 


Vki 


a 


n\ f Ak,a{rx) 


J { a{n/k) 
in D([0,T]‘^) for every T > 0. 


- {a-P - l)r'--PM{x) 


A a ^ZL(rax) - ZL(rx), 


Proof of Lemma 7. Making use of the homogeneity of the function 
L, write 

Afc,a(?’x) = {Lfc,a(rx) - L(rx)} - {Lfc(rx) - L(rx)}. 

Using the Skorohod construction, it follows from equations (8) and (12) that 


sup 

Q<xi,...,Xd<T/r 


\kJ [ a{n/k) 


- {a-P - l)r^-^M(x) 


-a Ziirax) + ZL{rx.) 


tends to 0 almost surely, as n tends to inhnity. □ 
Proof of Theorem 4. Note that 

. Afc ^„(ax) . Afc ^„(x) 

Lfc(x)— -Lfe(ax)- 


a{n/kp) 
= Lk{-x) 


a{n/kp) 

^k„,a{aA ^kn,aA) 
— a- 


— a- 


^kp,aiA) ^k,aiZ~) 

a{n/kp) 


a{n/kp) a{n/kp) 

Under a Skorohod construction, Lemma 7 allows us to write the expansions 
of the terms Afc^a(x), Afcp^a(x) and Afc^^a(ax), which implies on the one hand 

^fcp,a(®x) Afc (j(x) 

— a 


(27) 


a{n/kp) a{n/kp) 
= a{a~P — l)^M(x) 
1 


+ 


+ o 


y/Vpa{n/kp) 

1 


{a ZL(a x) - 2Zi(ax) + aZi(x)} 


\AA>An/kp) 


and 


(x)A 


k,a 


X 


a{n/kp 


= a{n/k){a ^ —l)^M^(x) 
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(28) 


+ (a"^-l)M(x) 


a - Zl{-x) 


y/k 


+ 


Opf 


a{n/k) 


+ 


\^/^a{n/kp) y/k-^^pain/k 


+ o 


(it 


on the other hand, both uniformly for x G [e,T]'^. Combining (27) and (28) 
with equation (8), one gets 

f , <^fcp,a(o.x) - Afc Q,(x) 

Lfc(x)— -hfc(ax) 


a{n/kp 


a{n/kp 


= a{a ^ — l)^M(x)L(x) H— —M{:x.){a ^ —l)(a^ ^Zi(x) — Zi(ax)) 

V ^ 


+ 


y/¥pa{n/kp) 


L(x){a ^^(a x) - 2ZL(ax) + aZi(x)} 


+ o 


4 +0 


VkJ \y%a{n/kp) 


Since the last expression and equation (27) are, respectively, the numer¬ 
ator and denominator of Lfc fcp^a(x), one obtains, after simplification, 

/T/f / \ ^ a~^Zliy.) — a~^Zliay.) ( \fk \ 

a P -1 \y/kpa{n/kp)J 

since M does not vanish by assumption. The choice of the sequences k and 
kp allows us to conclude since \/fc = 0{A^p(^{n/kp)). □ 


Proof of Proposition 5. Applying Lemma 7, we have 

Afe,a(x) 


(29) 

As a consequence, 


sup 

e<x\^...^Xd'^T 


a{n/k) 


- {a-P-l)M{x) 


0 . 


sup 

6<Xl,...,Xd<T 


Afc,a(l’x) 


— r 


= sup 

£<Xi,...,Xd'^T 


^k,aA 

Afc,a(px)/a(n/A:) 


Afc,a(x)/a(n/A:) 


sup 


a{n/k) 


— r 


_ ^1-p 


l-p^k, 

2 A 


a{n/k) 
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since (a ^ — l)M(x) 7 ^ 0 by assumption. Writing 


Afc,a(?’x) 


a{n/k) 


< 


Afc,a(?’x) 


a{n/k) 


a{n/k) 


-r^-P{a-P 


+ 


r^-P{a-P - l)M(x) - 


ainjk) 

and using twice equation (29) leads to the conclusion. □ 


Proof of Proposition 6 . Define Qkari^) ■= ■ Lemma 7 used 

twice yields 

(30) Vka(^^ {Qk,aA^) - logrZp,a,r(x), 

where Zp^a,rA) Is defined in Proposition 6 . Since A,a,rA) — 1 “log(Qfc,a,r(x))/log 
the result follows straightforwardly from (30) and the Delta method. □ 
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